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1 Introduction 



. The World Health Organization [WHO 95] defined the Quality of Life (QoL) as: 

CO 

u the individuals ' perceptions of their position in life in the context of their culture and the 



value systems in which they live, and in relation to their goals, expectations, standards and 
concerns. It is a broad-ranging concept affected in a complex way by the persons ' physical 
health, psychological state, level of independence, social relationships, personal beliefs, and 
their relationship to the salient features of their environment." 

In a survey about quality of life, Fayers and Machin [FAY 00] defined the Health related Quality of 
Life (HrQoL) as: 

"the way, which according to health of a person influences his/her capacity to lead on 
physical and social normal activities.'''' 

The "normality" of an activity is a variable concept, where it depends on the reference population. 
The use of indicators about HrQoL in clinical and epidemiological contexts is various: in primis these 
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indicators give additional information to evaluate the effect of care therapies on a patient. More and 
more, HrQoL is considered as a secondary endpoint of clinical trials, because the main focus is on the 
survival. However, in some contexts, such as that of pain therapies for terminal cancer patients, it 
rises to a primary endpoint. 

The main problem related to HrQoL concerns its measurement because this characteristic is not 
directly observable. Suitable measurement methods are then needed which are typically based on 
qualitative information, coming from ad hoc questionnaires, to be translated into quantitative infor- 
mation; see [MES 10] for a detailed illustration about statistical aspects involved in the measurement 
of HrQoL. In such a way it is possible to evaluate the patient's condition in relation to the general 
condition of the population so as to provide clinicians with information useful for the care and therapy 
decisional process. In this context, the Rasch model [RAS 61] represents an important tool to measure 
HrQoL. However, some important aspects must be taken into account. 

First of all, HrQoL is a latent multidimensional concept, because its proper evaluation requires 
consideration of several dimensions (corresponding to different latent traits) that reflect individual 
health conditions and how well patients are coping with the stress due to illness. Often, two macro- 
dimensions can be distinguished: a physical one and a psychological one; these are further separable 
in sub-dimensions, such as bodily pain and physical functioning in the former case, and mental health 
or vitality in the latter one. Usually, the latent traits corresponding to the different dimensions are 
highly correlated and are also correlated with other latent traits, such as those corresponding to some 
psychopathological disturbs, mainly anxiety and depression. However, the classic Rasch model is based 
on the assumption of unidimensionality. The easiest approach that is adopted when this assumption 
is not realistic consists of estimating separate Rasch models for subsets of items measuring different 
latent traits, but this method does not allow us to measure the correlation between these latent traits. 
A more suitable approach is based on in the multidimensional extension of Rasch models. 

The second aspect is that, in many applications, it is of interest to detect homogeneous classes of 
individuals who have very similar latent characteristics. Detecting these classes of individuals can be, 
not only more realistic, but also more convenient for the decisional process because individuals in the 
same class will receive the same clinical treatment. 

In order to analyse HrQoL data taking into account the above aspects, in this chapter we proposed 
the use of a version of the Rasch model which belongs to the class of multidimensional Item Response 
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Theory (IRT) models proposed by Bartolucci [BAR 07]. The model is characterized by two main 
features: (i) more latent traits are simultaneously considered {multidimensionality assumption); (ii) 
these latent traits are represented by a random vector having a discrete distribution common to all 
subjects (discreteness assumption). Each support point of this distribution identifies a different class 
of individuals. Obviously, these are latent classes, in the sense that we do not know to which class 
a given individual belongs; moreover, we do not know how many latent classes exist. The adopted 
model is then related to the latent class (LC) model (see [LAZ 68]; [GOO 74]) and for this reason will 
be referred to as the multidimensional LC Rasch model. 

We note that the LC model originates as a method to classify individuals on the basis of categorical 
responses, but, more recently, the same discrete latent structure on which this model is based has been 
exploited to account for the unobserved heterogeneity between subjects into other models. Using this 
structure can be considered as an alternative to the inclusion of continuous random effects which avoids 
a parametric specification of the distribution of these effects. A semi-parametric model then results. 
In particular, this structure has been used by [LIN 91] and [FOR 95] to define a unidimensional LC 
Rasch model, which is a particular case of the model that we here adopt. An alternative generalization 
of the Rasch model to the LC analysis is represented by the mixture Rasch model of Rost [ROS 90]. 
It may be seen as an extension of LC Rasch model, which allows for different sets of item parameters 
for each latent class. 

Another model which is strongly related to the multidimensional LC Rasch model is the LC factor 
model proposed by Magidson and Vermunt [MAG 01]. However, in the first approach each item 
response is affected by only one of the latent traits, and these latent traits may be correlated, whereas 
Magidson and Vermunt [MAG 01] assume that each item response may be simultaneously affected 
by two or more latent traits, which are mutually independent. We also have to mention alternative 
specifications of a multidimensional Rasch model that have been proposed; see, among others, the 
multidimensional marginally sufficient Rasch model proposed by Hardouin and Mesbah [HAR 04]. 

Aim of this chapter is also that of studying the correlation between the latent dimensions of HrQoL 
in cancer patients and those behind some psychopathological disturbs. This analysis involves two tests 
of dimensionality. The first is a likelihood ratio (LR) test which is based on the multidimensional LC 
Rasch model that exploits the discrete (or LC) marginal maximum likelihood (MML) approach. The 
second test is based on the Martin-L6f test (ML) approach [MAR 70] and exploits the conditional 
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maximum likelihood (CML) estimation method (see also [GLA 95b]). Alternative tests have been 
proposed by several authors; for a review see Verhelst [VER 01]. In particular, we mention the 
approach of Christensen et al. [CHR 02], who propose a test similar to the first test we here use, but 
it is based on the assumption that the latent traits follow a multivariate normal distribution. 

The remainder of this chapter is structured as follows. In Section 2 we describe the dataset used 
for the illustrative application. In Section 3 we present the multidimensional LC Rasch model of 
Bartolucci [BAR 07], with special attention to model assumptions and estimation from the practical 
point of view. In Section 4 we illustrate how to estimate the correlation between latent traits on the 
basis of the estimated parameters of this model. In the same section we describe the two tests of 
dimensionality based on the MML and CML approaches. Finally, in Section 5 we present the results 
of the application to the dataset described in Section 2. 

2 The dataset 

In order to illustrate the approach presented in this chapter, we analyze data which come from an 
Italian multi-centric clinical study. These data concern 275 oncological patients recruited from three 
different centres (Ancona, Perugia, and Messina). Patients were asked to fill some questionnaires 
about different latent characteristics; here, we consider HrQoL, anxiety, and depression. In particular, 
HrQoL is assessed by the "36-item Short-Form Health Survey" (SF-36) of Ware et al. [WAR 02], 
whereas anxiety and depression are assessed by the "Hospital Anxiety and Depression Scale" (HADS) 
of Zigmond and Snaith [ZIG 83]. The response rate is equal to 74% (203 patients out of 275). However, 
the sample of respondents is here assumed to be representative of the entire sample, since there are no 
significant differences between the distributions of age, gender, marital status, education, and cancer 
diagnosis; see Table 1 for a comparison between the population and sample distributions of these 
variables. 

SF-36 is a multidimensional test developed in the nineties to evaluate HrQoL during the last four 
weeks of illness; it has been validated in many different languages. The test consists of 36 polytomous 
items divided in the following 9 subsets (corresponding to different latent traits): 

1. PF: physical functioning (10 items); 

2. RF: role functioning, that is limitations in daily activities as a result due to physical health 
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Table 1: Entire and respondent sample distributions of age, gender, marital status, education, and 



cancer diagnosis (column percentages). 
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problems (4 items); 

3. BP: bodily pain (2 items); 

4. GH: general health (5 items); 

5. VT: vitality (4 items); 

6. SF: social functioning (2 items); 

7. RE: role-emotional, that is limitations in daily activities as a result due to mental health problems 
(3 items); 

8. MH: mental health (5 items); 

9. HC: health change (1 item). 

The items have a different number of response categories; to simplify the illustration of the results, 
in the present study all the items are dichotomized, with category 1 indicating the presence of a 
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symptom or limitation (related to a low level of HrQoL), and category indicating its absence (related 
to a high level of the HrQoL). Table 2 shows, for each item, which original responses correspond to 
new category and which original responses correspond to new category 1. For the item coding used 
for the questionnaire SF-36 see [WAR 02]. 



Table 2: SF-36: original response categories and their dichotomization. 
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* Item labels are referred to the Italian 
version of SF-36. 



HADS is a questionnaire designed to assess anxiety and depression in patients with organic diseases, 
such as cancer. The questionnaire is composed by 14 items referred to two dimensions: 

1. anxiety (7 items); 

2. depression (7 items). 

All items have four response categories but have been dichotomized for our analysis, with category 1 
(corresponding to original categories 2 and 3) indicating the presence of anxiety (or depression), and 
category (corresponding to original categories and 1) indicating its absence. For the item coding 
used for the questionnaire HADS see [ZIG 83]. 

Note that, in both questionnaires, every item is assumed to measure only one latent trait, but 
these latent traits may be correlated. 
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3 The multidimensional latent class Rasch model 



In the following, we describe the model that we adopt for the analysis of HrQoL data. We recall 
that this is a version of the Rasch model which belongs the class of IRT models proposed in [BAR 
07] and that the main differences with respect to the classic Rasch model [RAS 61] are that (i) 
multidimensionality is assumed and (m) the latent traits are assumed to have a discrete distribution. 
To facilitate the illustration of the model, we make explicit reference to the application described in 
Section 2. 

3.1 Model assumptions 

Let n denote the number of subjects in the sample and suppose that they respond to k test items 
which measure D different latent traits or dimensions. Also let Id, d = 1,... ,D, be the subset of 
I = {1, ... ,k} containing the indices of the items measuring the latent trait of type d and let kd 
denotes the cardinality of this subset, so that k = Yld=i ^d- Since we assume that every item measures 
only one latent trait, the subsets Id are disjoint; obviously, these latent traits may be correlated. In 
our study, n = 203 and k = 50 (36 items from SF-36 and 14 items from HADS) and, on the basis of 
validation studies about these questionnaires, we know that the items measure D = 11 different latent 
traits: 9 related to HrQoL and 2 related to psychopathological disturbs. 
In this context, the classic Rasch model, based on the assumption 

io g it[p(Xi = i\ e = e)] = e-p i} i = i,...,k, (i) 

where 6 can be discrete or continuous, is extended in a multidimensional way on the basis of the 
following parameterization: 

D 

logit[ppQ = l | = 0)] =Y l (8idPd- Pi), i = l,-..,k. (2) 

d=l 

In the above expressions, Aj is the random variable corresponding to the response to item i; in partic- 
ular, Xi = 1 means the presence of the symptom or disturb detected by the item and Aj = means 
its absence. Moreover, in the unidimensional version, is the latent random variable corresponding 
to the trait of the interest and 9 denotes one of its possible values, whereas /3j is the difficulty of item 
i. For the multidimensional version, = (©i, . . . , &d)' is the vector of latent variables corresponding 
to the different traits measured by the test items, = (9±, . . . , 6d)' denotes one of its possible real- 
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izations, and 6^ is a dummy variable equal to 1 if item i belongs to 1^ (and then it measures the dth 
latent trait) and to otherwise. 

Note that, as already mentioned, the adopted model is based on a specific formulation of multidi- 
mensionality, in which each item measures only one latent trait; we are then assuming the so-called 
between-item multidimensionality. This is different from the so-called within-item multidimensionality, 
in which each item measures more than one latent trait (see Adams et al. [ADA 97] for a detailed 
description). Another important point is that the unidimensional formulation based on (1) is a special 
case of (2). In particular, one of the possible situations in which formulation (2) specialises into (1) is 
when the possible ability vectors have elements equal each other. 

In order to clarify the above point, consider a simple example involving only two latent traits, so 
that = (0i, 62)', which may assume the values represented in Figure 1. 

(a) Unidimensional case (b) Bidimensional case 



CD 



CD 



Figure 1: Plots of (9\, 62) possible values. 

We observe that panel (a) corresponds to the situation of unidimensionality, because all points 
are aligned on the bisector. In other words, one latent trait coincides with the other and the actual 
number of dimensions is one. On the contrary, in panel (b) the points are not on a bisector and there is 
not a complete ordering of them; in this case it is not possible to completely determine one latent trait 
from the other and we can consider the two latent traits as corresponding to two distinct dimensions. 

In order to complete the model specification, we need to assume a distribution for the vector 0, 
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since we adopt a random-effects rather than a fixed-effects approach. In the latter, a distinct vector 
of parameters would be estimated for each subject, but this would not allow certain analyses which 
are of interest in the present context. 

In a standard formulation, a continuous distribution vr(0), such as the normal distribution, is 
adopted. In such a case, the manifest distribution of the full response vector, X = (X\, . . . ,Xk)', is 
given by 

p(x) = p(X = x) = / p(x\e)ir(0)de, (3) 
Je 

where, because of the assumption of local independence, 

k 

p(x|0) = p(X = x|0 = 6) = Y[p(Xi = xi | © = 0). (4) 

i=\ 

Note that, because of the between- item multidimensionality assumption, we assume local independence 
separately for each dimension and therefore 

k D kd D kd 

l[p(x i = x i \® = e) = l[]\ P (x dh = x dh \@ = e) = l[]J P (x dh = x dh | Q d = e d ), (5) 

i=l d =lh=l d =lh=l 

where X d h denotes the response variable for the h-th. item measuring dimension d. Note that the 
notation Xdh is equivalent to the previous one, in the sense that each Xi coincides with a certain Xdh 
for suitable d and h. However, in this way we make explicit the dimension measured by this item. 

It is worth recalling that the theory about the Rasch model does not require the latent trait of 
interest to have a continuous distribution. Moreover, assuming a discrete distribution may have some 
advantages. Among others, it can lead to more parsimonious models in given situations, it can be used 
to test the additivity assumption of the Rasch model, and to construct empirical Bayes estimators of 
abilities [LIN 91]. We adopted this alternative approach and, therefore, we assume that the random 
vector has a discrete distribution with support {<^ 1 , . . . , Ccl- ^ n other words, we assume that the 
population is composed by C latent classes or subpopulations. We outline that the number of latent 
classes is the same for each dimension. In theory, it should also be possible to assume a more general 
model with a latent class structure that differs between dimensions, but in this case the interpretability 
of parameters is not straightforward. As for the conventional LC model (see [LAZ 68]; [GOO 74]), 
the manifest distribution of the full response vector is given by 

c 

=p(X = x) = ^p(x|C c )vr c , (6) 

c=l 
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where ir c = p(& = £ c ) is the weight of the cth latent class, that is the probability that a subject 
belongs to this latent class. Note that, according to equation (4), p(x|£ c ) is defined as 

D k d 

p(x|C c )=p(X = x|0 = O = II JIp(Xdh = Xdh \6d = (cd), c=l,...,C. (7) 

d=l h=l 

The specification of the multidimensional LC Rasch model, based on the assumptions illustrated 
above, univocally depends on the number of latent classes (C) and on how items are associated to the 
different dimensions (i.e., the subsets Z^, d = 1, . . . , D). Maximum likelihood estimation of this model 
is illustrated in the following. 

3.2 Maximum likelihood estimation and model selection 

Assuming that the subjects in the sample are independent, the log-likelihood of the model is 

%) = $>(x)log[p(x)], (8) 

X 

where r\ is the vector containing all the parameters, n(x) is the frequency of the response pattern x, 
and the probability p(x) is defined as in equation (6) depending on r\. Moreover, the sum ranges 
over all the possible configurations of the response vector. 

In order to maximize l(rj), and then obtaining the (discrete) MML estimate of r], we make use of 
the EM algorithm [DEM 77], which is an iterative algorithm based on alternating two steps (E and 
M). The expectation (E) step consists of computing the expected value of the complete log-likelihood 
evaluated at the current value of the parameters. The maximization (M) step consists of updating 
these parameters by maximizing the expected log-likelihood found at the E step. These parameters are 
then used to determine the distribution of the latent variables at the next E step. The two steps are 
performed until convergence. See [BAR 07] for a detailed description of this estimation algorithm and 
its initialization. At this stage we only stress that, in order to prevent the problem of the multimodality 
of the model likelihood, it is advisable to try different starting values chosen by both deterministic 
and random rules. We denote by i) the MML estimate of r], that is the value that, at convergence 
of EM algorithm, corresponds to the highest log-likelihood. Similar notation is adopted for the single 
parameters. 

Once the model is fitted, we can allocate each subject to one of the latent classes. The allocation 
depends on the specific item response pattern x provided by the subject. In particular, for each 
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pattern x it is possible to estimate the posterior probabilities of belonging to latent class c as follows 

p(C c |x)=p(e = c c |x = x)= p(x j Cc ^ c , c = i,...,c. (9) 

An individual is assigned to the latent class with the highest posterior probability. 

An important phase connected with the estimation process is represented by the model selection, 
with special attention to the choice of the number of latent classes (C). A first approach [KIE 56] 
would suggest to use the value of C corresponding to the saturation point beyond which the likelihood 
of the assumed model fails to increase. For the unidimensional LC Rasch model, the existence of this 
saturation point was established by Lindsay et al. [LIN 91]. However, in our context, this number of 
classes could be so large that the model becomes non-identifiable or almost non-identifiable. To avoid 
this, several authors suggest to use information criteria, which are typically based on penalizing the 
log-likelihood by a factor that takes into account the number of parameters as a measure of the model 
complexity. 

One of the most known information criteria is represented by the Bayesian information criterion 
(BIC) of Schwarz [SCH 78], which is based on the index 

BIC = -2£ + g ln(n), 

where I is the maximum value of the log-likelihood and g is the number of non-redundant parameters. 
For the multidimensional LC Rasch model, in particular, we have 

g = (C-l)+DC + {k-D), (10) 

since there are C — 1 mass probabilities for the classes, DC ability parameters, and k — D difficulty 
parameters (we have to consider that some of them are constrained to in order to ensure model 
identifiability) . Among different models, the one with the smallest value of the BIC index has to be 
preferred. For an illustration in the context of mixture models, which is strongly related to the present 
one, see McLachlan and Peel [MCL 00], Ch. 6. 

3.3 Software details 

In our application, we used the software accompanying the paper [BAR 07] to estimate the model and 
choose the number of classes. This software can be downloaded from the web page 
http://www.stat.unipg.it/bartolucci and is quite simple to use for practitioners. It is mainly 
based on a MATLAB function, named lcest.m, which requires the following inputs: 
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• a matrix with each row corresponding to a different configuration of item responses and each 
column corresponding to an item (responses are coded as and 1 and are suitably separated); 

• a vector with the frequencies of the observed response configurations; 

• the number of latent classes; 

• the type of starting values (deterministic or random); 

• the type of model (LC model, multidimensional LC Rasch model, multidimesional LC two- 
parameter logistic model); 

• a matrix indicating the multidimensional structure of the set of items (any row contains the set 
of items referred to the same dimension). 

The following output is obtained from lcest .m for the estimated model: 

• maximum log-likelihood; 

• estimated probabilities of the latent classes; 

• BIC index; 

• estimated difficulty parameters (and discriminating indices for the LC two-parameter logistic 
model); 

• estimated support points for each latent class and for each latent dimension; 

• estimated probability of responding 1 to every item for each latent class; 

• estimated posterior probability of belonging to every latent class for each response configuration. 

Function lcest .m also allows us to select the optimal number of latent classes. We have to estimate 
a multidimensional LC Rasch model with an increasing number of latent classes and the best model 
is chosen on the basis of a given selection criterion, such as BIC. In a similar way we can estimate the 
multidimensional LC Rasch model with different multidimensional structures in order to choose the 
optimal number of dimensions and to evaluate the possibility of collapsing two dimensions into a single 
one (see Section 4 for theoretical details on tests of dimensionality and Section 5 for an application). 
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Other outputs, such as the correlations between latent dimensions or between every item and the 
corresponding dimension, cannot be obtained directly by lcest.m, but a little extra programming is 
sufficient. 

3.4 Concluding remarks about the model 

It is already clear that a crucial assumption of the multidimensional LC Rasch model described above 
is that of discreetness of the latent trait distribution. Alternatively, we could assume that the vector 
follows a multivariate normal distribution of dimension D. However, if the variance-covariance 
matrix of this distribution is unconstrained, computing the manifest distribution of X would require 
complex numerical tools when D is greater than 2, as in our application, because a high dimensional 
integral is involved (see equation (3)). This has obvious implications on the parameter estimation and 
on the possibility of making a comparison with our model, which is based on a discrete latent trait 
distribution. 

On the other hand, the model based on a multivariate normal distribution for the latent traits 
would be easily estimable under the constraint that the variance-covariance matrix is diagonal, so that 
the latent traits are independent. In fact, estimating this model is equivalent to estimating a separate 
unidimensional normal Rasch model for each dimension. The same holds for the version based on a 
discrete latent trait distribution. It is worth noting that, with the same number of latent classes (C), 
the latter model involves more non-redundant parameters than the multidimensional LC Rasch model 
that we propose to use. In fact, this model has D(C — 1) + DC + (k — D) parameters which is larger 
than the number of parameters given in (10). 

Though estimating separate unidimensional Rasch models does not allow us to study the correlation 
between the latent traits, this may be useful for assessing whether the assumption of discreteness of 
the latent trait distribution is suitable or not for the data at hand. This amount to comparing the 
global fit of these models under this assumption with that obtained under the assumption of normality. 
Such comparison may be based on standard criteria, such as BIC. 

Finally, we have to remark that the multidimensional LC Rasch model may be easily extended to 
deal with polytomous items and incomplete data. Concerning the first issue, the extension would be 
based on adopting a suitable parameterization of the conditional response distribution given the latent 
traits, which extends that in (2). At this regard, we can use one of the parametrizations illustrated 
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by Samejima [SAM 96] in her review of IRT models for polytomous data. About incomplete data, we 
have to clarify that the model and the estimation algorithm can be easily extended to treat the case 
of incomplete responses by design, because it is sufficient to drop, for each subject, the items to which 
the subject does not respond. The extension to the case of informative missing responses is not so 
obvious. 

4 Inference on the correlation between latent traits 

In our context, the main interest is on the study of the correlation between the latent traits corre- 
sponding to the different dimensions. 

First of all, for two dimensions d\ and efo (e.g., BP and SF), the correlation may be measured 
through the index 

C 

Pd 1 d 2 =Yl Cdi Cd 2 %c, (11) 
c=l 

where c = 1,...,C, d = 1, . . . , D, is the standardised estimate of the latent trait level referred 
to dimension d for the subjects in latent class c. In practice, this is the dth element of £ c , once the 
average latent trait level has been subtracted from each element and this difference has been divided 
by the standard deviation. Moreover, tt c is the estimate of the weight of the c-th latent class. 

A crucial point is how to test the hypothesis that two dimensions d\ and efo are perfectly correlated. 
This hypothesis is strongly related to the hypothesis that the items in 1^ and measure the same 
latent trait. Given the parameterization in (2), the latter may be expressed as Hq : £ c ^ 2 = ^ 1 + a, 
with c = 1,...,C, where a is an arbitrary constant. In the following, we deal with two different 
approaches to test Hq. 

The first approach is directly based on the LR statistic between the multidimensional latent class 
Rasch model, in which a separate latent variable is used to represent each dimension, and a restricted 
version of this model in which dimensions d\ and d<i are collapsed into a single one. Both models 
are estimated by the discrete MML method, and then by the EM algorithm illustrated at the end of 
Section 3. From this estimation we obtain the maximum log-likelihood of the general model and 
that of the restricted model (£q). It has to be clear that both models are based on the same number 
of latent classes; they only differ in the number of latent variables and in how items are allocated to 
these variables. On the basis of these likelihoods, we obtain the following LR test statistic for the null 
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hypothesis H : 

LR 1 = -2(£ -£ 1 ). (12) 

Under Hq, LR\ has an asymptotic chi-square distribution with C — 1 degrees of freedom, where C 
is the selected number of latent classes. We reject Hq for high values of LR±, that is values larger 
than a suitable percentile of the asymptotic distribution; otherwise, we do not reject this hypothesis, 
implying that the two dimensions are indeed collapsible. In order to measure the evidence provided 
by the data in favour of Hq, we can also compute a p- value, as the value of the survival function of 
the asymptotic distribution at LR\. 

One of the main advantages of the above LR test is that, if properly extended, it can be also used 
when items discriminate differently among subjects (i.e., when the Rasch paradigm does not hold); 
however, this aspect is beyond the purposes of the present chapter. On the other hand, this approach 
requires the choice of the number of latent classes and the results may depend on this choice. For this 
reason we also consider a second approach for testing if two sets of items measure the same dimension, 
which does not require to formulate any assumption on the distribution of the latent variables. This 
approach is based on the ML test [GL A 95b] . See Martin-L6f [MAR 70] for the original version of the 
test for dichotomous items split in two dimensions and Christensen et al. [CHR 02] for a generalization 
to polytomous items and to situations with more than two dimensions. It is worth mentioning that, 
in the psychometric literature, alternative tests are available which may be more powerful in certain 
situations. We refer, in particular, to the class of one degree of freedom tests proposed by Verhelst 
[VER 01]. 

ML test is again an LR test between two models. The main difference with respect to the test 
based on statistic LR\ is that the maximum log-likelihoods of the two models under comparison are 
obtained from the CML estimates (see [RAS 61]; [AND 70]; [AND 72]). The maximum log-likelihood 
of the general model obtained in this way is denoted by £i, whereas that of the restricted model is 
denoted by £q. The resulting LR test statistic is then 

LR 2 = -2(4-4). (13) 
In particular, the first log-likelihood is obtained as 

Il=f£ +£ { S +tlm, (14) 
where 1^ is the maximum conditional log-likelihood for the items in 1^, 1^} is the maximum con- 
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ditional log-likelihood for the items in , both obtained through the CML method, and t\ m is the 
maximum marginal log-likelihood of the multinomial model for the distribution of the scores. The 
latter may be expressed as 

^d 2 

hm= Y n rir2 log(n rir2 /n), (15) 

ri=0 r2=0 

where r\ is the test score for the items in the subset (e.g., PF), r2 is the test score for the items 
in the subset Zd 2 (e.g., RF), and n rir2 is the frequency of subjects with scores r\ and T2 at the two 
subsets; see also [VER 01]. We recall that the test score is the number of items that are responded by 
category 1 and that kfa and k& 2 are the number of items from the first dimension (e.g., PF) and from 
the second dimension (e.g., RF), respectively. 

A decomposition similar to that in (14) holds for £q. We have 

to=$Oc + iom, (16) 

where 1§ C is the maximum conditional log-likelihood for the items in U X^ 2 , and 

kd l +kd 2 

4m = n r \og(n r /n). (17) 

r=0 

In the above expression, r is the test score achieved at the items in both subsets, the frequency of 
which is denoted by n r . Under Hq, the test-statistic Li? 2 has an asymptotic chi-square distribution 
with kd 2 — 1 degrees of freedom. 

Though the ML test has the advantage of not requiring the choice of the number of latent classes 
neither the formulation of distribution assumptions on the latent variables, Ver heist [VER 01] and 
Christensen et al. [CHR 02] found that its power is significantly affected by the length of the question- 
naire and it may be disappointingly low with strongly correlated latent dimensions. More precisely, it 
can be verified that the null distribution may deviate from the asymptotic chi-square distribution be- 
cause of the large number of items in comparison with the sample size. In the psychometric literature 
there are several proposals to solve this problem. For example, Christensen and Kreiner [CHR 07] 
proposed a Monte Carlo test procedure for computing p- values. Moreover, another problem connected 
with the ML test concerns the fact that this test can only be used when the Rasch paradigm holds, 
since otherwise the CML method cannot be applied. 

In order to take into account advantages and disadvantages of the two tests described in this 
section, in our empirical study we apply both. The LR test based on the discrete MML is calculated 
by means of a set of Matlab functions which are available upon request. The ML test is performed 
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by means of STATA as concerning the computation of the maximum conditional log-likelihoods (£o c , 
Ac ' ^lc )' an< ^ through a suitable Matlab function as concerning the marginal counterparts (l~o m , 
£i m ). The same methods could be applied by alternative statistical softwares, such as MULTIRA 
of Carstensen and Rost [CAR 01], the %pml SAS macro of Christensen and Bjorner [CHR 03] or 
DIGRAM of Kreiner [KRE 03]. 

Finally, we have to underline that the above methodology gives valid results provided that the 
Rasch paradigm holds for each dimension. In order to check that this is the case, we have to fit 
separate Rasch models for each of these dimensions, and those that violate this paradigm should not 
be included in the analysis. For this purpose, several testing procedures are available in the literature 
(see, among others, [LIN 94] and [GLA 95a]). These procedures are mainly based on the comparison 
with the two-parameter logistic model and with the saturated model by chi-square or LR statistics, 
and on infit and outfit statistics. 

5 Application results 

First of all, with reference to the data described in Section 2, we verified that the Rasch paradigm 
clearly holds for only five dimensions; we recall that each dimension corresponds to a subset of items 
measuring a specific aspect and these subsets are defined by the structure of the questionnaires. Three 
of these dimensions are referred to HrQoL, that is BP, SF, and VT, whereas the other two are referred 
to psychopathological disturbs, that is anxiety and depression. Therefore, we based our analysis on 
the responses to the items referred to these five dimensions, whereas the items referred to the other 
ones were discarded. Overall, the data we analysed concern 22 items. 

The first step of the analysis was focused on verifying if a discrete latent structure, on which the 
LC Rasch model is based, is indeed more suitable for the data at hand with respect to a continuous 
latent structure based on the normal distribution. As explained in Section 3.4, we cannot directly 
compare the multidimensional LC Rasch model with its normal counterpart because of the difficulties 
in estimating the latter. However, we can indirectly perform this comparison by estimating separate 
undimensional (LC and normal) Rasch models for each dimension. The results of this comparison are 
reported in Table 3 in terms of maximum log-likelihood and BIC index for each model. 

We observe that the assumption that the distribution of the latent traits is discrete has to be pre- 
ferred to the assumption that this distribution is normal. In fact, for each dimension, the unidimen- 
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Table 3: Maximum log-likelihood and BIC index for the unidimensional Rasch model fitted for each 
dimension by the MML method under the assumption that the latent trait has a normal distribution 
and under the assumption that is has discrete distribution with a number of support points (latent 
classes) between 1 and 7; for each dimension, in bold are the data referred to the model with the 



smallest BIC. 







normal 




LC Rasch model - 


number of latent classes (C) 




Dimension 


Rasch model 


1 


2 


3 


4 


5 


6 


7 


BP 


I 

BIC 


-193.4 
402.7 


-206.9 
424.4 


-190.5 
402.2 


-190.5 
412.8 


-190.5 
423.5 


-190.5 
434.1 


-190.5 
444.8 


-190.5 
455.4 


SF 


t 

BIC 


-157.8 
331.5 


-163.1 
336.9 


-147.2 
315.7 


-147.2 
326.4 


-147.2 
337.0 


-147.2 
347.6 


-147.2 
358.3 


-147.2 
368.9 


VT 


1 

BIC 


-495.1 
1016.7 


-511.0 
1043.2 


-454.6 
941.2 


-446.8 
936.2 


-446.8 
946.8 


-446.8 
957.5 


-446.8 
968.1 


-446.8 
978.8 


Anxiety 


i 

BIC 


-671.6 
1385.8 


-753.5 
1544.3 


-653.4 
1354.8 


-642.7 
1343.9 


-641.0 
1351.1 


-641.0 
1361.8 


-641.0 
1372.4 


-641.0 
1383.0 


Depression 


I 

BIC 


-636.5 
1315.5 


-701.6 
1440.4 


-607.7 
1263.2 


-602.0 
1262.4 


-602.0 
1273.0 


-602.0 
1283.6 


-602.0 
1294.2 


-602.0 
1304.9 


Overall 


t 

BIC 


-2154.4 
4452.2 


-2336.1 
4789.2 


-2053.5 
4277.1 


-2029.2 
4281.7 


-2027.5 
4331.4 


-2027.5 
4384.7 


-2027.5 
4437.7 


-2027.5 
4491.0 



sional LC Rasch model always attains a higher maximum log-likelihood than its normal counterpart, 
while the minimum BIC index is always smaller. The same happens at global level (see the last two 
rows of Table 3). 

Then, we fitted the multidimensional LC Rasch model with an increasing number of latent classes. 
To this regard, in Table 4 we report the values of the maximum log- likelihood and of the BIC index 
obtained with a number of latent classes (C) from 1 to 7. 



Table 4: Maximum log-likelihood and BIC index for the multidimensional latent class Rasch model 
with a number of latent classes between 1 and 7; in bold are the data referred to the model with the 
smallest BIC. 









Number 


of latent classes (C) 








1 


2 


3 


4 5 


6 


7 


I 

BIC 


-2321.8 
4760.6 


-2066.7 
4282.1 


-2016.3 
4213.2 


-1989.8 -1966.9 
4192.1 4178.3 


-1949.3 
4175.0 


-1937.7 
4183.7 



According to the adopted selection criterion, we chose C = 6 latent classes corresponding to 
BIC = 4, 175.0. We observe that this model has a better fit than the model which is equivalent to 
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separate unidimensional LC Rasch models. For the latter we had 4,277.1 as minimum value of the 
BIC index; this value is reached with 2 latent classes (see Table 3). For the multidimensional LC 
Rasch model with C = 6 latent classes, we obtained the parameter estimates (support points and 
class weights) displayed in Table 5. In particular, we estimated 5 support points (corresponding to 
the number of dimensions) for each of the 6 latent classes. Due to the adopted parameterization, see 
equation (2), very low values of these support points correspond to a negligible probability of response 
1, whereas very high values correspond to a negligible probability of response 0. To facilitate the 
interpretation of the results, the latent classes are ordered on the basis of the conditional probability 
of response 1 for the first dimension (BP). 

Table 5: Estimated support points and probabilities of the latent classes (for each dimension, in bold 
the highest support point, in italic the smallest one). 









Latent class 






Dimension 


1 


2 


3 


4 


5 


6 


BP 


— oo 


-2.17 


-0.92 


-0.91 


-0.20 


1.48 


SF 


-2.56 


— oo 


-1.71 


-2.92 


— oo 


0.79 


VT 


-2.36 


-3.28 


-0.46 


1.09 


-2.23 


0.93 


Anxiety 


0.00 


-4.87 


-3.40 


-1.85 


-1.55 


-0.05 


Depression 


1.30 


-4.10 


-1.99 


0.12 


-1.50 


0.76 


Probability 


0.042 


0.223 


0.383 


0.134 


0.131 


0.086 



A crucial point of the present study is the interpretation of the latent classes, which define homo- 
geneous groups in the population. This interpretation is not always easy, but some useful suggestions 
come from the estimates of the support points and class probabilities. In particular, the smallest classes 
are the first (4.2% of subjects) and the last (8.6% of subjects). These classes contain, respectively, 
patients with the highest tendency to psychopathological disturbs and with the highest impairment 
with respect to BP and SF. The second class contains more subjects (22.3%); these subjects are in the 
best conditions with respect all dimensions, with the exception of BP. The third class is the largest 
(38.3% subjects) and includes subjects with an intermediate tendency to be ill with respect to each 
dimension. Finally, the fourth and fifth latent classes have a very similar size (13.4% and 13.1% of 
subjects) and include, respectively, subjects with the worst conditions with respect to VT and the 
best conditions with respect to SF. Note that individuals belonging to the second and the fifth latent 
class show a very good level with respect to SF. 

As already mentioned, it is important to analyze the correlation between the different dimensions 
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of HrQoL, and the correlation with anxiety and depression. At this aim, in Table 6 we report the 
correlation coefficients estimated on the basis of equation (11) for every pair of dimensions. 

Table 6: Estimated correlation matrix between latent traits. 





SF 


VT 


Anxiety 


Depression 


BP 


0.679 


0.794 


0.932 


0.989 


SF 




0.934 


0.388 


0.596 


VT 






0.530 


0.707 


Anxiety 








0.971 



As regards the dimensions of HrQoL, we observe that VT is strongly correlated especially with the 
other psychological dimension SF {p = 0.934). As regards the association between the dimensions of 
HrQoL measured by the SF-36 questionnaire and the psychopathological disorders measured by the 
HADS questionnaire, both anxiety and depression show a very high correlation with BP (p = 0.932 
and p = 0.989, respectively). On the other hand, the association of anxiety with SF and VT is much 
weaker (p = 0.388 and p = 0.530, respectively), and the association of depression with SF and VT 
assumes intermediate values (p = 0.596 and p = 0.707, respectively). Finally, as we may expect, the 
correlation between anxiety and depression is very high (p = 0.971). 

When the correlation between two latent traits is high, it is interesting to verify if the corresponding 
items actually measure the same dimension. At this aim, we performed both tests described in the 
previous section, that is the LR tests based on the discrete MML and the CML methods, for every 
pair of dimensions. LR statistics and the corresponding p- values are shown in Table 7 for the first test 
and in Table 8 for the second one. 

Table 7: Results from the test of dimensionality based on the discrete MML parameter estimates; in 
brackets the p-values. 





SF 


VT 


Anxiety 


Depression 


BP 


27.1 (0.000) 


14.0 (0.007) 


22.2 (0.000) 


27.5 (0.000) 


SF 




12.5 (0.014) 


32.1 (0.000) 


26.3 (0.000) 


VT 






51.2 (0.000) 


70.3 (0.000) 


Anxiety 








9.1 (0.059) 



We observe that the values of the test statistic based on discrete MML method are almost always 
smaller than the corresponding values computed on the basis of the CML method, but the main 
conclusions are similar. In fact, both test statistics have a null asymptotic chi-square distribution, 
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Table 8: Results from the test of dimensionality based on the CML parameter estimates (ML test); in 
brackets the p-values. 





SF 


VT 


Anxiety 


Depression 


BP 


23.0 (0.000) 


38.6 (0.000) 


53.7 (0.000) 


57.0 (0.000) 


SF 




30.7 (0.000) 


42.9 (0.000) 


36.3 (0.001) 


VT 






126.7 (0.000) 


112.7 (0.000) 


Anxiety 








83.3 (0.001) 



but the second is based on a larger number of degree of freedom. Nevertheless, both kinds of test do 
not show any evidence of unidimensionality. A weak evidence of unidimensionality appears under the 
test based on the discrete MML estimates for anxiety and depression (p-value equal to 0.059) and for 
VT and SF (p- value equal to 0.014). Then, the present study confirms that the dimensions defined by 
the SF-36 and HADS questionnaires are indeed separate, and collapsing some of them would imply 
an inappropriate simplification of the phenomena investigated by the test items. 
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